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Abstract 

We examine directed spanners through flow-based linear programming relaxations. We de- 
sign an 0(n 2 / 3 )-approximation algorithm for the directed fc-spanner problem that works for all 
fc > 1, which is the first sublinear approximation for arbitrary edge-lengths. Even in the more 
restricted setting of unit edge-lengths, our algorithm improves over the previous 0(n 1_1 / fc ) ap- 
proximation BGJ + 09| | when k > 4. For the special case of k = 3 we design a different algorithm 



achieving an 0( v /n)-approximation, improving the previous 0(n 2 / 3 ) [EP05, BGJ + 09{ . Both of 



our algorithms easily extend to the fault-tolerant setting, which has recently attracted attention 
but not from an approximation viewpoint. We also prove a nearly matching integrality gap of 
Jl(ri3" e ) for any constant e > 0. 

A virtue of all our algorithms is that they are relatively simple. Technically, we introduce 
a new yet natural flow-based relaxation, and show how to approximately solve it even when its 
size is not polynomial. The main challenge is to design a rounding scheme that "coordinates" 
the choices of flow-paths between the many demand pairs while using few edges overall. We 
achieve this, roughly speaking, by randomization at the level of vertices. 
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1 Introduction 



We examine several directed spanner problems from the perspective of approximation via a linear 
programming (LP) relaxation. In particular, we design for these classical NP-hard problems flow- 
based LP relaxations, and then investigate how well these relaxations approximate the optimal 
spanner, providing nearly matching upper and lower bounds. We begin by introducing the spanner 
problems that we consider, focusing throughout on directed graphs; we briefly compare to undirected 



graphs in Section 1.4 



1.1 Spanner Problems 

Let G = (V,E) be a a strongly connected directed graph.[] A k-spanner of G, for k > 1, is a 
subgraph G' = (V,E'), that preserves all pairwise distances within factor k, i.e. for all u,v G V, 

d G <(u,v) < k ■ d G (u,v). (1) 

Here and throughout, du denotes the shortest-path distance in a graph H. It is easy to see that 
requiring (|]) only for edges (u, v) 6 E suffices. 

In the directed /c-spanner problem with unit lengths, the input is the graph G, and the goal is to 
find a A;-spanner G' having the minimum number of edges. We allow the stretch k to be a function of 
n = \V\, e.g. k = O(logn), and in fact some of our results are most interesting when k = f2(logn). 
This definition was introduced by Peleg and Schaffer [ PS89f| (in particular, they showed the problem 



is NP-hard), and since then it has been studied extensively, with applications ranging from routing 
in networks (e.g. RAP95| , |TZ05l ) to solving linear systems (e.g. |ST04| , |EEST08|] ). 



The above definition has several natural generalizations. An obvious one is to let G have 
nonnegative edge-lengths, leading to more complicated distances. This is the directed £;-spanner 



problem with arbitrary edge-lengths. Another generalization, introduced in [CLPR09], incorporates 
fault-tolerance: a A;-spanner G' is r -vertex-tolerant if for all F C V with \F\ < r we have that G'\F 
is a fc-spanner of G \ F. The definition of r -edge-tolerant is the same, except that F is a subset of 
E rather than of V. Clearly, the special case r = is just the standard notion defined above. This 
paper address both of these generalizations. 



Yet another generalization of the problem is the client-server model [EP01]: the input contains 



also a set C C E of so-called client edges and a set S C E of server edges, the requirement (|T]) 
is only needed for edges in C, while the spanner is only allowed to use edges in S (i.e. E' C S). 
Obviously, the case C = S = E is just the standard notion defined above. Our results extend to this 
model in a straightforward manner, but for the sake of exposition, we shall not address it directly. 

1.2 Results 

We first present a flow-based LP relaxation for spanner problems (Section 0). This relaxation is 



quite natural but appears to be new, and in particular it differs from the ones used in [DK99 



|BGJ + 09 |. We then use this LP relaxation to obtain the approximation algorithms described below 



(see also Table |l|) 



1 The assumption of strong connectivity is for notational convenience, although the definitions and all our results 
extend easily to all digraphs. 
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General stretch k. Our first algorithmic result is an 0(n 2 / 3 )-approximation for the directed 
£;-spanner problem that works for all k > 1, even with arbitrary edge-lengths (Section ||). This 
is the first approximation algorithm that handles the more general case of arbitrary edge-lengths. 
And even for unit edge-lengths, it improves over the previously known 0(n 1-1 / fc )-approximation for 
general k > 3 due to Bhattacharyya, Grigorescu, Jung, Raskhodnikova, and Woodruff [ BGJ + 09 [. 



Thus our result shows that the approximation need not increase with k, and provides the first 
sublinear (in n) approximation ratio for k > logn. The cases k = 2,3 are addressed separately 
below. Furthermore, using the reduction of BGJ + 0Sj1 from Transitive-Closure fc-spanner to di- 



rected /c-spanner we obtain for the former problem an 0(n 2 / 3 )-approximation, improving over their 
0(min{n 1_1 / fe , n//c 2 })-approximation for all k <C n 1 / 6 . 

We complement the above algorithmic result by showing that our (rather natural) LP relax- 
ation has an integrality gap of J7(^n 1 / 3_e ) for every constant e > 0, even in the unit-length case 
(Section |3.2j ). Previously, Elkin and Peleg | JEP07 ] proved that for every fixed < e,5 < 1 and 



3 < k = o(n s ), approximating the directed fc-spanner with unit edge-lengths problem within ra- 
tio 2 log n is quasi-NP-hard (similar hardness results were already known for smaller ranges of 



k Kor01 |). We conclude that a polynomial approximation (independent of k) is probably the best 



one can hope for, and specifically the best possible exponent appears to be in the range [1/3, 2/3]. 

Stretch k = 3. For directed 3-spanner with unit-length edges we achieve an even better 0(y/n)- 
approximation (Section |j). Notice that this approximation factor matches, up to lower order factors, 
the 0(y / n)-approximation known for undirected graphs (an immediate consequence of the absolute 
guarantee of [ ADD + 93 | that every undirected graph has a fc-spanner with 0(n 1+2 ^ k+1 ^) edges) 



The previous approximation known for this case is 0(n 2//3 ), first proved by Elkin an d Peleg [EP05 |. 



A similar approximation can be obtained also by the aforementioned algorithm of | BGJ + 09| 1, and 
by our first algorithm mentioned above. 

Stretch k = 2. This case (directed 2-spanner with unit-length edges) is rather exceptional and 
is known to have tight approximation bounds: O(logn) approximation [ |KP94 . EP01 and r2(log 



n) 



NP-hardness [ Kor01[| .p| We show similar bounds on the integrality gap of our LP relaxation (in 



Section |j|) , a finding that is not very surprising but affirms the strong connection between our LP 
relaxation and the approximability threshold. 

Fault-tolerant spanners. We also adapt our algorithms to the fault-tolerant setting, albeit 



restricted to unit length edges (see Sections |3j] and -LI ) . The fault-tolerant setting is significantly 



more complicated: the LP relaxation might have an exponential number of both variables and 



constraints (see Section |2.1| ), and we must resort to bicriteria approximations when the number 
of faults r is not constant. Generally speaking, the approximation factors we obtain grow with r 
like k r in the first algorithm (for general k), and polynomially in r in the second algorithm (for 
k = 3 with unit-length edges). These are the first results for fault-tolerant spanners in directed 
graphs. For undirected graphs, absolute guarantees (i.e. not as approximation factors) are known 



flCLPR09 |, and these bounds also grow like k r when vertices fail, even for the k = 3 case. 



It is possible to refine the approximation in terms of the graph's average/maximum degree. 
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Directed /c-Spanner with unit edge-lengths 



Stretch 


Our Approximation 


Previous Approximation 


Integrality gap 


k > 4 


0(n 2 / 3 ) Thm p| 


Oin 1 - 1 ^) 1BGJ+09|J 


^(i. n i/3- £ ) Thm 


3j 


k = 3 


O^ 1 / 2 ) Thm|4.3| 


0(n^' 6 ) [EP05, IBGJ+09I1 


n( n i/3-e) Xhm 


3^ 


k = 2 


O(logn) Thm|5.2| 


O(logn) |KP94, EP01|] 


fi(logn) Thm 


y 



Similarly but with r (vertex/edge) fault-tolerance 



Stretch 


Our Approximation 


Previous Approximation 


k>A 


( ^ i0(( (l + e)K fc + : ) r nlnn )2/3)) ^ 




k = 3 


(sTO' ^ 172 )) Thm 4.4 




fc = 2 


0(r log n) Thm 5.5 





Table 1: Summary of our approximation results compared with previous work 



Note: Shortly after completing the initial version of this paper we became aware of a preprint 
by Berman, Raskhodnikova, and Ruan [BRR10], that, independently of our work, constructs an 



0(k ■ n 1-1// r fc / 2 l log n) approximation for directed unit-length fc-spanner. For k = 3 this gives 
essentially the same bound as our 0(y / n)-approximation (up to logarithmic factors), and for k = 4 
their 0(^/nlogn) ratio is a polynomial improvement over our 0((n log n) 2 / 3 ) approximation. For 
k = 5, 6 the approximation ratio of their algorithm becomes 0(n 2 / 3 log n), basically matching 
ours. Their techniques are not based on linear programming; instead, they first prove that every 
valid spanner can be covered by generalized stars using few edges, and then design a set-cover-like 
approximation algorithm for the minimum-size generalized-star cover problem. While their ratio is 
better than the previous 0(n 1-1 / fc )-approximation [BGJ + 09|, it still only applies to the unit-length 



setting and degrades with k so as to give a nontrivial bound only when k < O(logn), while our 
0(n 2 / 3 )-approximation suffers from neither of these limitations. 

1.3 Techniques 

All of our approximation algorithms rely on solving the LP relaxation and rounding the resulting 
"fractional" solution. In some cases the LP relaxation does not have polynomial size, but we can 
solve it within a reasonable approximation in polynomial time by reducing it (via duality and the 
ellipsoid algorithm) to a problem known in the literature as Restricted Shortest Path (Theorems 



2.1| and p.2|) . A virtue of all our algorithms is that they are relatively simple, and thus can be 
extended to more complicated scenarios with little effort, as is evident in the fault-tolerance case. 

Our main technical contribution is to design new rounding procedures which use few edges but 
are effective in creating many suitable paths. Two very natural and well-known rounding techniques 
fail miserably: (1) rounding separately each edge proportional to its LP value (deterministically 
or randomly) is unlikely to form suitable paths, and (2) rounding separately each flow-path (say 



randomly a la Raghavan and Thompson | RT87 Q will use far too many edges. The challenge is thus 
to "coordinate" the selection of edges so that they tend to create suitable paths. Put differently, 
each (u, v ) 6 E can be seen as a demand pair with its own flow, and we need to select one flow-path 
for each pair in a way that is "biased" towards using the same edges. Ideally, we would like to 
select both the edges and the flow-paths proportionally to their LP value. 
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Our first algorithm, for general k, is based on classifying demand pairs according to whether 
they have "few" or "many" low-stretch paths. The key insight is to make this classification rely on 
counting vertices participating in low-stretch paths. The algorithm is then almost straightforward: 
applying a threshold rounding of the LP handles pairs of the first type, and building shortest-path 
arborescences from a small number of randomly chosen vertices handles pairs of the second type. 
This algorithm is described in Section ||. To extend this algorithm to the fault-tolerant case we use 
the appropriate LP relaxation and apply the above rounding technique to several "perturbations" 
of the instance, each obtained by deleting from the graph a random subset of vertices/edges. This 



algorithm is described in Section |3.1| . We believe that this perturbation technique, which we call 
failure sampling, is of independent interest, and may find future applications in related fault- 
tolerant problems. For example, a simple application of failure sampling combined with existing 
undirected spanner constructions gives a fault-tolerant undirected spanner construction with an 
absolute bound on its size that is only 0(r 3 ) larger than for a non-fault-tolerant spanner (where r 
is the number of faults), while the best previous construction |CLPR0S | has size 0(k r ) larger than 
a non- fault-tolerant spanner. 

Our second algorithm, for directed 3-spanner with unit-length edges, uses randomized rounding 
but at the level of vertices rather than edges or flow-paths. For every v £ V we choose a random 
threshold T v 6 [0,1], and include in the solution every edge (u,v) £ E for which, compared to 
the edge's LP value, either mm{T u ,T v } is "small" or max{T u ,T v } is "moderate". The probability 
of including an edge in this solution is proportional to the edge's LP value, but our conditions 
encourage positive correlation along a path (e.g. for edges sharing an endpoint). The proof boils 
down to considering a given demand pair, and analyzing the possibly many different flow-paths 
between them. In some cases, we control the correlation between these flow-paths using Janson's 
inequality (see e.g. [ AS00| , DP09| ]). But in other cases such correlation analysis is not effective, 



so we structurally "decompose" the paths into their first, second and third hops and then use 
standard concentration bounds separately for each hop plus some global arguments based on flow 
conservation. This rounding procedure extends to the fault-tolerant case very easily; we just repeat 
the rounding procedure several times with fresh coins. These algorithms are described in Section |||. 

1.4 Concluding Remarks and Future Work 

Our results for directed /c-spanner with unit edge-lengths address what Elkin and Peleg [EP05] 



highlighted as two "challenging directions": obtaining sublinear approximation for general k and 
improving over their (5(n 2//3 )-approximation for k = 3. LP-based approaches are quite generic yet 
often times optimal, and thus it would not be surprising if the integrality gap of our LP relaxation 
gives away the problem's true approximability threshold, which appears to be polynomial with 
exponent in the range [1/3,2/3] (possibly just 0{^fn)) for all k > 3. 

A bolder conjecture would be that this LP relaxation also exposes the approximability thresh- 
old for other spanner problems. One such family of problems is the undirected setting, whose 
consideration we defer to future work but briefly discuss some preliminary results. For k = 3 the 
known approximation (and thus integrality gap) is 0(yjn) by the absolute guarantee of [ ADD + 93] ] 



(or our Theorem ^^), while our best integrality is ^(n 1 ' 8 ), which we prove by a somewhat tricky 
analysis of the random graph G n ^ v for p ~ 1/^/n. For larger k, the known approximation (and 
thus integrality gap) is 0(n 2 ^ k+1 ^) via | ADD + 93 ], while our best integrality gap is using 



the reduction designed by |EP07[ from Min-Rep with large girth (which they conjecture to be NP- 
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hard). Compared to [EP07], our integrality gap may be viewed as yet another indication to the 
inapproximability of basic fc-spanners, but with a factor much closer to the known approximation 
algorithm. 



2 Flow-Based LP Relaxation 



We begin by describing the linear programming relaxation of the directed fc-spanner problem that 
we will use. Consider an instance of the directed fc-spanner problem: a directed graph G = (V, E) 
and an assignment of lengths to the edges d : E — >• M + . For (u, v) G E, let V u ,v denote the set of all 
stretch k paths (in G) from u to v, i.e. valid paths whose length is within factor k of the shortest. 
It is easy to see that the following LP is a relaxation of the /c-spanner problem. The variables are 
x e , representing whether edge e G E is included in G' , and fp, representing flow along path P. 



min ^2 x e 












s.t. ^ fp <x e 


V(u,v) G E, 


Ve G E 


PeVu.v-.eeP 








V(u,v) G E 










X e >0 


Ve G P 




fp>0 


V(u,v) G E, 


VP G V u , v 



(2) 



In general LP (|2|) has exponential size. If the number of paths in V u ,v is & t most polynomial for 
all (u, v) G E (for example if all lengths are unit and k is a constant) then the LP obviously has 
only a polynomial number of variables and constraints, so can be solved optimally. But in general 
we will need a different approach. We will work with the dual, which has a polynomial number of 
variables and an exponential number of constraints. So it is sufficient to find a separation oracle 
for the dual. It turns out that the separation problem for the dual is the Restricted Shortest Path 
problem, sometimes also called the Length Constrained Lightest Path problem. A PTAS is known 



for this problem [ LR01 , Has92 |, so we can approximately separate for the dual. This is enough to 
approximately solve the primal. 

Theorem 2.1. There is an algorithm that in polynomial time computes a (1 + e) approximation 
to the optimal solution of LP (0) for any constant e > 0. 

Proof. The dual of LP (|J) is LP (||), which has a variable for every edge and a variable for every 
pair of edges. The intuition is that the y e a v variables must form a "fractional cut" of (u, v) relative 
to stretch-/c paths. 



max ^2 z u> „ 












s-t. Yl y tv - 1 


Ve G E 




(u,v)eE 








\/(u,v) G E, 


VP G V u ,v 


eeP 








V(u,v) G E 




y e u ,v > o 


V(u,v) G E, 


Ve G E 



(3) 
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To construct a separation oracle for this LP, note there are only a polynomial number (\E\) of 
constraints of the first type, so we can just check them one by one. For constraints of the second 
type, note that for every u, v E V the values {y e u v } e eE are just a non-negative edge- weighting, and 
the constraint just require all of the original stretch k paths to have total length (under this new 
weighting) of at least z UtV . So we get the following problem: given two weightings of the same 
graph, find the shortest path under the second weighting subject to having length at most T > 
under the first weighting (for some threshold T) . If we could solve this we would have a separation 
oracle for the dual. Note that in the unit-length case stretch-/c paths correspond exactly to fc-hop 
paths, so we can solve this problem exactly using Bellman-Ford. 

For the general lengths setting, this problem has been considered in the literature under the 
names "Length Constrained Lightest Path" and "Restricted Shortest Path" . An FPTAS is known 
to exist ULROlj , |Has92( |, which gives us an approximate separation oracle. So by using Ellipsoid with 
this oracle we find a polynomial number of constraints such that the optimal solution violates all of 
the other constraints (which we did not include) by at most 1 — e, i.e. there might be paths where 
(1 — e)z u>v < XleeP Vuv So if we simply let z' u v = (1 — e)z UjV we have a feasible solution for LP (|3|) 
that is within 1 — e of optimal. So the optimum of LP @ is at least 1 — e times the optimum of the 
compact dual (informally, changing to only a polynomial number of constraints did not affect the 
value of the optimal solution very much) . Thus by strong duality if we use solve a compact version 
of LP (||) that has only the variables corresponding to the constraints found by Ellipsoid on the 
dual (of which there are only a polynomial number) we get a solution of value equal to the optimum 
of the compact dual, which is at most 1/(1 — e) times the value of the actual dual (LP □ 



2.1 Fault Tolerant Relaxation 

There are two versions of the r-fault tolerant /c-spanner problem, depending on whether we protect 
against edge faults or vertex faults. The idea in both cases is the same, so we shall focus on vertex 
faults: construct a subgraph H of G such that for every set F of at most r faulting vertices, H\F 
is a A;-spanner of G \ F. We can change LP (|2|) to support this version by allowing a different set 
of flows {fp } for every such F, but using the same capacity variables {x e }. More formally, let V^ v 
be the set of stretch-A; paths from u to v in G \ F (where F is a set of edges for the edge tolerant 
version or is a set of vertices for the vertex tolerant version). For F C V, let Ep C E be the set of 
edges with at least one endpoint in F. We will use the following relaxation for the vertex version 
(the edge version is analogous): 



min x e 




























S.t. ^ fp <X e 


VF C V : 


\F\ 


< r, 


V(n, 


v)€E\ Ep, 


Ve G F \ Ep 


Pev£ v : eeP 
















VF C V : 


\F\ 


< r, 


V(n, 


v) E E \ Ep 


















X e > 


Ve 6 E 












fp >o 


VF C V : 


\F\ 


< r, 


V(n, 


v)EE\ E f , 


VF E V F 

v ^ ' u,v 



(4) 



It is easy to see that LP (Q) has constraints. Each possible fault set acts like an instance 

of the original spanner LP (g), except for the sharing of the capacity variables {x e } e& p. So when 
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we take the dual we get a program with variables, and if r is constant the separation oracle 

we designed for the non-fault-tolerant version suffices to separate this LP as well. When r is not 
constant this technique does not work as the dual will have a superpolynomial number of variables. 
Instead we will give a bicriteria algorithm, which in the unit-length case will find a fractional 
assignment to the x e variables of cost at most — ^ times larger than the cost of the best fractional 
solution, and that supports flows that satisfy the constraints for all F of size at most r/((l + e)k). 

Theorem 2.2. For any e > there is a polynomial time algorithm that, given an instance of the 
unit-length directed r -fault-tolerant k-spanner problem, finds a set of fractional capacities {x e } e ^E 
with the following two properties: 1 ) there exist flow variables that satisfy the flow and capacity 
constraints of LP (Q) for all fault sets of size at most jt^j. , end 2) Yl,e&E x & * s °^ mos ^ ~p ^ mes 
larger than the optimal solution to LP (0) 

Proof. When r is super-constant there is a super-polynomial number of constraints in the LP, so 
we cannot solve it using earlier methods (when we transform to the dual we get a super-polynomial 
number of variables). Instead of going through the dual we will stick with the primal and give 
a separation oracle. However, since ellipsoid with a separation oracle takes time polynomial in 
the dimension (i.e. the number of variables) we need to transform the problem into one with a 
polynomial number of variables. We do this in a simple way: we simply project the polytope down 
on the capacity variables x e , of which there are only 0(m). The objective function of LP (Q) uses 
only the x e variables, so optimizing over this projection is sufficient to optimize over the full LP. 
And since this is a projection of a convex set it is itself convex, so if we can design a separation 
oracle the ellipsoid algorithm will run in polynomial time. 

So what would a separation oracle for this projected polytope be? Simply examining LP (Q) 
shows that a setting of the capacity variables {x e } e< zE is not a valid solution if and only if there is 
some set of at most r faults such that it is impossible to send 1 unit of flow between all demands. 
Slightly more formally, {x e } e ^E is not a valid solution if and only if there is some fault set F (of 
size at most r) and edge (u,v) £ E such that: 

1. (u,v) F for the edge-fault case, or u, v F for the vertex-fault case, and 

2. There is no way of sending one unit of flow along stretch-A; paths from u to v in G \ F while 
respecting capacities {x e }. 

By strong duality, the maximum flow that can be sent along stretch-/c u — v paths is equal to 
the smallest fractional cut, where a fractional cut is an assignment of values y e to the edges such 
that X^eGP Ve — 1 f° r an P ^ 'Puw The size of such a cut for a particular fault set F is ^ee£ x eUe- 
So for every fault set F of size at most r, for every remaining demand (u,v), for every fractional 
cut {y e } e £E relative to F and to (it, v), any feasible solution {x e } e£ E has X^eeE Ve x e > 1- These are 
the violated constraints that our separation oracle will find (or at least will approximately find). 

So to construct a separation oracle, we want to find a set of faults F and demand (it, v) with 
the smallest fractional cut. If the size of this cut is less than 1, then we have found a separating 
hyperplane, and if there is no such set F then the current capacities are feasible. In order to solve 
this problem, which we will call Stretch-A; Interdiction, we first write it as a mixed-integer 
program. Since there are only a polynomial number of (it, v) demands we can simply try them all, 
so our formulation is for some given (u,v). This formulation is for the vertex-fault version; the 
edge-fault version follows the same basic idea. Recall that the x e 's are the capacity variables in the 
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original problem, so in this context they are fixed constants and thus the objective function and 
the constraints are linear. The intention of MIP (||) is for z w to represent whether vertex w is part 
of the fault set and for the {y e } variables to represent a fractional cut of the remaining paths. 



min ^ x e y e 




* 1 1 

s-t. 2^ ( y ( a > b ) + 2 Za + 2 Zb ' ~ 




VP E V u ,v 


(a,b)eP 




y~] z w <r 




wev 




z u z v 




z w E {0, 1} 


\/w G V 


Ve >0 


Ve G P 



(5) 



Claim 2.3. MIP (§) is an exact formulation of Stretch-A; Interdiction 

Proof. Note that there is a one-to-one correspondence between settings of the z w variables and 
possible fault sets. For every setting of the z w 's, the objective value is the minimum cost fractional 
cut (where we have to cut stretch- A; paths that do not hit any faults), which is exactly what we are 
trying to optimize. □ 

Lemma 2.4. There is a bicriteria approximation for Stretch-A; Interdiction that uses at most 
(1 + e)kr faults (instead of r) and has cost at most times the best r-fault solution 

Proof. In order to solve MIP (S) we relax the integrality constraints on the z w variables to < z w < 
1, giving us a linear program. We can solve the resulting LP by constructing its own separation 
oracle: if we define the length of an edge (a, b) to be yufy + \z a + \z^ is there a stretch-fc path with 
length less than 1? Since we only consider the fault-tolerant setting for the unit-length case we can 
actually solve this problem exactly using Bellman-Ford (since in this case stretch-A; is equivalent to 
A;- hop). So we can solve this LP in polynomial time. 

Now we need to round the z w variables to integers. We will use a very simple threshold rounding: 
if z w > m^pb then set z' w = 1; otherwise set z' w = 0. Furthermore, set y' e = ^-^y e - Since any stretch- 
k path is a fc-hop path, if YlweP Zw — then z w > f° r some w G V, and thus z' w covers 

P. On the other hand, if Y,weP z v> < TT^ then ^e&pVe > so EeeP^e = ^EeeP^ ^ 1 



and the y' variables cover P. Thus 



Ay') 



is a valid solution to the MIP except that ^ 



(1 + e)kr instead of being at most r. In other words, we have designed a ((1 + e)k, — t 
approximation algorithm for MIP (JSh and thus for Stretch-A; Interdiction. 



wev z w — 
)-bicriteria 

□ 



By using this bicriteria approximation with original fault budget r/(l + e)k instead of r, we 
will find a separating hyperplane (whose coefficients are the {y' e } variables) as long as there is some 
fault set F of size at most r/(l + e)k and demand (u, v) for which the maximum stretch-A; flow (or 
equivalently the minimum fractional cut) is at most e/(l + e). So using this separation oracle with 
the Ellipsoid algorithm and then rounding the capacities we find up by gives us a bicriteria 
algorithm for LP (0), yielding the theorem. □ 
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3 Approximations for Directed /c-Spanner 



We will now design a 0(n 2 ' 3 )-approximation algorithm for the directed fc-spanner problem. We 



first solve LP (|2|) as detailed in Theorem 2.1 to get a fractional solution (x, /). We then round this 
solution using Algorithm [l], which has two main components: a simple threshold rounding scheme 
together with a collection of shortest path arborescences. 

Algorithm 1: Rounding Algorithm for Directed fc-spanner 

1 E' <- {e E E : x e > l/(3nlnn) 2 / 3 } 

2 for i <— 1 to (3nlnn) 2 / 3 do 



3 
4 
5 



Choose v E V uniformly at random 

T\ n <— shortest path in-arborescence rooted at v 

T° ut <— shortest path out-arborescence rooted at v 



6 Output E' U ( u { ^ nn? '\Ti n U T { 



To show that this algorithm gives a valid £;-spanner, we begin with a lemma that characterizes 
edges that are satisfied by the thresholding. For every (u, v) in E, let N uv C V be the set of 
vertices that lie on a path of stretch at most k from u to v (i.e. the set of vertices that are used by 
at least one path in V u ,v)- 

Lemma 3.1. For any (u,v) E E there is a path P E Vu,v with the property that every edge e E P 
has Xe - \Nu, v \ 2 

Proof. Suppose this is false for some (u,v). Let B C N u>v x N u<v be the set of edges with x e < 
1/lAu^l 2 . Then every path P E V u ,v goes through at least one edge in B, so these edges form a cut 
between u and v relative to the paths in V u ,v Since we have a valid LP solution, we know that at 
least one unit of flow is sent from u to v using paths in V u ,v This means that the number of edges 
in B must be at least |A^ Uj „| 2 . But this is a contradiction: every edge in B has both endpoints in 
N UjV , so there are at most ('^2'"') < l-^w^l 2 °f them. □ 

So if l-ZVu^l is small, Lemma |3.1| implies that there is some stretch k path with the property 
that every edge is assigned a large capacity. On the other hand, if \N U)V \ is large then there are 
many nodes that are on stretch k paths, so we should be able to find such a path by picking nodes 
randomly. This is formalized in the following lemma: 

Lemma 3.2. If we sample at least |^ ln " vertices independently and uniformly at random, then 
with probability at least 1 — 1/n 3 at least one sampled vertex will be in N uv 

3n In n 

Proof. The probability that no sampled vertex is in N uv is at most ( 1 - ^ ) |JV ^' < e - 31n ™ = 
1/n 3 and thus the probability that at least one sampled vertex is in N uv is at least 1 — 1/n 3 □ 

Theorem 3.3. There is a polynomial time algorithm that with high probability returns a directed 
k-spanner of size at most (^((nlnn) 2 / 3 ) times the smallest directed k-spanner, for any k > 1. 
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Proof. The algorithm is simply to solve LP (|2|) using Theorem 2.1, and then round the solution 
using Algorithm |l|. We first prove that it results in a valid spanner with high probability. Consider 
some edge (u,v) G E. If |iV U|1J | < (3nlnra) 1//3 , then Lemma |3.1| implies that there is some stretch 
k path from u to v using edges contained in E' and thus in the spanner. On the other hand, if 
|-Wu,i;| > (3nlnn) 1//3 then Lemma 3^ implies that with probability at least 1 — 1/n 3 we will have 



sampled some vertex in N UiV . Suppose we sample w £ N uv on the ith. iteration. By the definition 
ofN u . v we know that w is on some path from u to v with stretch at most k, and thus the length of 
the shortest path from utoro plus the length of the shortest path from w to v is at most k- dc(u,v). 
These paths are contained in T\ n U T° ut , so the spanner will include both of these shortest paths 
and thus will include a path from u to v with stretch at most k. Taking a union bound over all 
(u, v) completes the proof that the returned subgraph is a /c-spanner. 

To prove that it is a 0((n In n) 2//3 )-approximation we will show that each of the two steps costs 
at most 0((ralnn) 2 / 3 ) x OPT. This is obvious for the LP rounding step: an edge e is in E' only if 
x e > l/(3ralnra) 2 / 3 , so \E'\ is at most 0((nlnn) 2 / 3 ) times the LP cost. To show that the second 
step does not add many edges, note that every iteration adds at most 2(n — 1) edges, and that 
n — 1 is a trivial lower bound on OPT (since we are assuming the underlying graph is connected; 
if it is not connected then it is easy to modify this analysis to still hold). Thus the total cost of all 
the arborescences is at most 2(3nlnn) 2 ' 3 x OPT □ 

3.1 Extension to r- Fault-Tolerant Version 

In order to adapt the rounding scheme of Algorithm [l] to the fault-tolerant case we need to show 
how to modify the threshold rounding and the arborescence sampling. It is simple to see that 
Lemma [O] still holds for every fault set, so the threshold rounding will still work (although we will 
change the threshold). But the arborescence rounding must be changed to allow for faults. The 
intuition behind the change comes from the technique of color-coding [ AYZ95| : before randomly 



sampling the root of an arborescence, independently fail each element (either edges in the edge- 
tolerant version or vertices in the vertex-tolerant version) with some probability p. We call this 
technique failure sampling. We then randomly sample a root and include its shortest-path in- and 
out-arborescences in the resulting subgraph. By setting p, the number of arborescences sampled, 
and the threshold of the rounding appropriately, we get the following theorem. We say that an 
algorithm is an (a, /^-approximation for the r-fault-tolerant directed fc-spanner problem if it returns 
a ax-fault-tolerant /c-spanner of size at most (3 times the smallest A:-spanner, and an algorithm is a 
true /3-approximation if it is a (1, /3)-approximation. 

Theorem 3.4. For any constant e > there is a polynomial-time algorithm that is a 
(i+e)fc ' ^ ( ( ^ 1+£ ^tr^fc fc — " lnn ) j) -approximation for the unit-length r-fault-tolerant directed 

k-spanner problem. There is also a true approximation algorithm with the same approximation 
ratio that takes time. 

Proof. Suppose that we have a feasible solution for LP (Q) (or at least an approximate solution 
from Theorem |2.2| ). We show how to round a fractional solution into an integer solution, assuming 
that all lengths are 1. Our rounding is basically the same as for the non- fault tolerant version. 
In particular, Lemma [O] still holds for every fault set. So we can, as before, set a threshold 
value t and round up any edge with x e > 1/t. The only difference comes in the random sampling 
step: in the non- fault-tolerant version, it sufficed to randomly pick centers of shortest path in- 



10 



and out-arborescences. But in the fault-tolerant setting that is no longer sufficient; the paths we 
construct must suffice even after failures, which simple shortest paths obviously will not. So we 
will add an extra step inspired by color-coding \ AYZ95| ] : before randomly sampling the root of an 



arborescence, independently fail each element (either edges in the edge-tolerant version or vertices 
in the vertex-tolerant version) with some probability p. We call this technique failure sampling. We 
then randomly sample the root of shortest-path in- and out-arborescences in the resulting subgraph. 

Consider some fault set F and some edge (u,v) that still survives in G \ F. Define Np(u,v) 
as in the no-fault setting: a vertex x is in Np(u, v) if x is on some stretch-A; u — v path in G \ F. 
Since Lemma [O] still holds, if \N(u,v)\ < \ft then the threshold rounding satisfies the demand. 
So we assume that \Nf(u,v)\ > \ft and analyze the probability that a single round of the random 
sampling will satisfy the demand. We will then perform the number of rounds necessary to be able 
to take a union bound over all possible F and (u, i>). 

Since we are assuming all edge lengths are 1, a stretch- A; path is the same as a /c-hop path. 
A sufficient condition for the sampling to succeed for F and (u, v) is for the arborescence root 
to be a vertex in Np(u,v), everything in F to be killed by the failure sampling, and nothing 
from the A>hop path containing the root to be killed (note that such a path must exist by the 
definition of Np(u,v)). Given that the the root is selected to be in Np(u,v), the probability that 
the particular A-hop u — v path containing the root is all preserved by the failure sampling is 
(1 — p) k . And clearly the probability that everything from F is killed by the failure sampling is 
p k , and is independent of the other two events. So the probability that all three events happen is 
p r x l^HHi^U x (1 — p) k > p r (l — p) k — . The number of possible failure sets F and demands (u, v) 
is at most (™) x Q) < n 2r+2 in the edge-failure setting; for vertex failures it is at most n r+2 . Let 
t be the number of rounds for which we repeat the random sampling. Then in order to succeed on 
all constraints with probability at least 1/2, we want 

1 f{\-p) k Vi V < i 



n J 2n 2r+2 
Setting p = and solving for £, we get that it is sufficient to set 

_ 2(2r + 2)(k + r) k+r n\nn 
r r k k y/t 

As in the non-fault-tolerant case, we balance out the cost of the sampling (£) with the cost of 
the threshold rounding (t) to get a total approximation of 



O 



r(k + r) k+r n In n \ 2 ^ 3 



r 



■k k J 



If r is constant, then this rounding combined with our ability to actually solve LP (4j) gives us 



the theorem for the r = 0(1) case. If r is not constant then we need to use Theorem 2fl before the 



rounding procedure, giving us the claimed bicriteria approximation. □ 
3.2 Integrality Gap 

We now complement our approximation algorithm by proving a nearly matching integrality gap. 
We do this by a reduction from the Min-Rep problem. In Min-Rep we are given a bipartite graph 
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G = (U,V,E) together with a partition of U and V into groups Ui, U2 ■ ■ ■ , U p and Vi, V2, ■ ■ ■ V p . We 
say that there is a superedge between two groups (U,Vj) if there is some u G U and some v G Vj 
such that {u, v} G E. The goal is to find a subset X C U U V of as few vertices as possible such 
that for every pair of groups (U, Vj) with a superedge there is some u £ Ud X and v £ Vj H X 
such that {u, v} G -E. 

Elkin and Peleg [EP07| proved hardness for directed fe-spanner by using a reduction from Min- 
Rep, and we will use their reduction to prove an integrality gap. But instead of reducing from 
generic Min-Rep instances as in a hardness proof, we will only apply the reduction to instances 
of Min-Rep in which every superedge actually corresponds to a matching between vertices, i.e. if 
(Ui, Vj) is a superedge then there is a matching between Ui and Vj. The interested reader might 



note that these are basically instances of the Unique Games Problem [Kho02]. 



We first give a lemma that was proved implicitly by Charikar, Hajiaghayi, and Karloff [CHK09]: 



Lemma 3.5. For any constant e > 0, there are instances of Min-Rep with the following properties: 

2 _ 

1. Every group has size n'-i e 

2. OPT > nl" e 



3. There is a matching between every Ui and Vj 

We now use this lemma to prove the main integrality gap theorem by applying the Elkin and 



Peleg reduction [EP07] to instances from Lemma 3J3 and showing there it has a small fractional 
solution. 

Theorem 3.6. The integrality gap of LP @ on the unit-length directed k-spanner problem is 
Q(^n3^ e ) for any constant e > 0. 

Proof. The instances that we use to prove this integrality gap are the instances we obtain by 



applying the reduction of Elkin and Peleg [EP07] to the instances from Lemma 3J5. We explain 
the reduction in detail so as to analyze the best fractional LP solution. Let r be the number of 
groups, so in the instances from Lemma 3.5 we have that r = n3 +t . For each group we will add 

2 

x = ii3" £ /((fc- l)/2) paths, where each path has length (k — l)/2 (for ease of exposition we assume 
that k is odd, but it does not actually matter). More formally, let (U',V',E') be a Min-Rep 
instance from Lemma 3.5, and let umr = \U' U V'\. Then the vertex set of our spanner problem is 



v=(u'u V) |J 



u 



p=l 



u 



r/2 ,(fc-l)/2 D p 
=1 U j=l 



x r/2 ,(fc-l)/2 ,p 
u p=l u i=l u j=l T i,j 



n M R+((n 3 MR 



/((*-i)/2))-4;- 



((k-l)/2) =2n MR , 



Note that n = \ V\ = n M R+x-r- (k- 1)/2 
so we have only doubled the number of vertices. 

The edge set is divided into a few different components. First, let E" = {(u,v) : u G U',v G 
V',{u,v} G E'} be the original Min-Rep edges but now directed from U to V. Next we add a 
clique to every group: let Eq = U^((L r j x Ui) U (Vi x Vi)). We also want to turn the new vertices 
into paths: let 



x r/2 (fc-3)/2 

I'm U U U {(«&>*&+iM*y.*&+i)} 

p=li=l j=l 
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so for every fixed i and p, the Em edges form a directed path from to / 2 , and similarly 

for the t vertices. We also add edges to connect these paths to the original vertices: let 

x r/2 

E U = U U ({(<(ft-l)/2' u ) : ^t/i}U {(M^) : f G Vi} 
p=i i=i 

Finally, we want to add edges to connect the endpoints of paths corresponding to superedges 
(which in our case is all (£/$, Vj): let 

p=l j=l J=l 

So our final edge set is E = E" U E c U #m UEuUEj. 



Elkin and Peleg [EP07| showed that the optimal spanner has size at least fi(ix OPTmr), where 
OPTmr is the size of the smallest Min-Rep solution. So in our case, the best spanner has size at 
least 

O | ^MR x J A = n ( 1 |-2. 
(fc-l)/2 A/R J V fc 

On the other hand, we claim that the best fractional solution is small, namely 0(n). To see 
this, consider the following fractional assignment. All edges inside the tails, i.e. all edges in Em, 

2 

have fractional capacity 1. Let q = umr/t = n»~ e be the size of each group. We set the fractional 
capacity of all edges in E" U Ec U Ejj to 2/q, and set the fractional capacity of all edge in Ej to 0. 
The cost of this solution is 



\E M \ + - {\E"\ + \E C \ + \E V \) < xr k -^ + l(^A + q \ + xrq ^ 



k — 3 2r 2 

< xr 1 h 2qr + 2xr 

2 4 



< n M R +n MR + 2n M R + 2n M R 
= 0{n) 

So it remains to prove that it is a valid fractional solution. We proceed by analyzing each type 
of edge. Obvious since every edge in Em is included with capacity 1, we are able to send one unit 
of flow. For some edge (u, v) G E" , we can send q — 1 flows, each of size 1/q, first to the vertices in 
the same group as u (via the Ec edges), then across the matching to the group containing v (via 
E" edges), and then back to v (via Ec edges). We can send the final 1/q flow directly on the edge 
(u,v). These paths have length 3 < k and obviously satisfy capacity constraints. For some edge 
(a, b) G Ec, we can do the same thing without crossing any matching: send l/(q — 1) < 2/q flow to 
each of the group mates of a, and then back into b. For an edge (sP (i e -i)/2> n ) ^ we can sen d 
1/q flow from Sj i (fc_i)/2 to each of the vertices in the group containing u (using E\j edges) and the 
from those vertices to u (using Ec edges). Similarly, for an edge (v, -J we can send l/(q — T) < 2/q 
flow to each of the other vertices in V%, and then from these vertices to . Finally, for an edge 
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( s ?i'^(fc-i)/2) e ^ I we can sen d one um t along the s? path, then split it into q paths using the 
Ejj edges, send each of those 1/q flows across the matching using E" edges, recombine the flow at 
t p - x using Ejj edges, and then send it down the path to t p - r k _iy 2 - Each of these q paths has length 
k and satisfies the fractional capacities. Thus this is a valid fractional solution. □ 

The reason that we lose an extra \ in the integrality gap is that because we are in the unit- 
length case we need to add many vertices in order to build long paths. It is easy to modify the 



proof of Theorem 3.6 to give a gap of Q(n3 e ) in the arbitrary lengths setting. 



4 Directed Unit-Length 3-Spanner 

While our 0(n 2/,3 )-approximation is an improvement over previous work for arbitrary edge lengths 
and for unit edge lengths with k > 3, for unit edge lengths with k = 3 it matches the previous 
bounds of Elkin and Peleg [EP05| and Bhattacharyya et al. [BGJ + 0£]. So for the specific case 



of unit-length directed 3-spanner we develop a different rounding algorithm for the flow-based 
LP (|2|) that gives an 0(-y/n)-approximation. Our algorithm first solves LP (|J) and then rounds it 
using Algorithm |2[ Informally, this rounding works by choosing a threshold value T v G [0, 1] for 
each vertex v G V. We then add all edges (u,v) where either T u or T v is at most px U;V , where 
p = 0(y / nlogn) is an inflation factor to make the probabilities large enough. This turns out to not 
be quite enough edges, so we also add all edges (u, v) where both T u and T v are at most ^Jpx UtV . For 
technical reasons we have to add an extra complication: every vertex will actually choose another 
threshold, T„, and edges are added as described for every combination of T and T 1 thresholds. 

Algorithm 2: Rounding Algorithm for 3-spanner 

1 Set p = C \fn log n for a large constant C 

2 For every v G V choose independently two values T v , T' v G_r [0, 1] 

3 Let E 1 = {(«,«) G E : mm{T u ,T^T v ,T^} < p ■ x u , v } 

4 Let E 2 = {(u,v) G E : max{min{T n , T' u ], min{T„, T' v }} < ^p ■ x UtV 

5 Output E' = Ei U E 2 



Lemma 4.1. Algorithm || returns a set of edges E' with E[|-E'|] < O(p) times the size of the 
smallest 3-spanner. 

Proof. Let (n, v) G E. Obviously Pr[T u < px U:V ] < px U:V , and similarly for T' U ,T V , and T' v . So 
Pr[(n, v) G E\] < Apx UyV by a simple union bound. To analyze E 2 , note that the probability that 
(n, v) G E 2 is equal to the probability that min{T u ,T^} < ^/px UjV and min{T^,T^} < ^/px UjV . 
Since these are independent, the probability that the both happen is equal to the product of their 
probabilities. And by another union bound we get that each of the probabilities is at most 2^/px W; ^, 
and thus the probability that (u,v) G E 2 is at most Apx u ^ v . Thus the probability that (n, v) G E' 
is at most 8px UtV , so by linearity of expectations the expected number of edges in E' is at most 
(u,v)eE x u,v 5 which is exactly 8p times the cost of the LP solution and thus at most 8p times 
the size of the smallest 3-spanner. □ 

Lemma 4.2. For every edge (u,v) G E, the probability that there is no path of length at most 3 
from u to v in E' is at most 1/e 
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Proof. We know that in the LP solution a flow of 1 unit was sent from u to v along paths of length 
at most 3. If at least 1/3 flow was sent from u to v through paths of length 1 (i.e. through (u, v)), 
then x u v > 1/3 so px UjV > 1, and thus (u,v) G E\ with probability 1. 

Alternatively, suppose at least 1/3 of the flow is sent through paths of length 2. Let wi,W2, ■ ■ ■ ,we 
be the midpoints of these paths (note that all Wi are distinct). For each i G [£], let /j = 
mm{x UtWi , x Wi V }. For a length 2 path to be in E' , it is sufficient for one of these Wi to have 
T Wi < pfi (since that would mean both (u,Wi) and (wi,v) would be in E±). If /j > 1/p for some i, 
then this happens with probability 1. If fi < 1/p, then the probability that none of these events 
occur is rii=i(l ~ Pfi) ^ er p ^ f*. Since 1/3 flow was sent along these paths, J2i /i > 1/3, and thus 
the probability that E\ does not contain length two u — v path is at most e _p//3 , which is clearly 
small enough to satisfy the lemma. 

The most difficult case is when at least 1/3 of the flow is sent through paths of length 3. For 
each such path P, let p\ denote the first vertex after u and let p2 denote the second vertex after 
u, so P = u — > pi — > p2 — > v . For every edge (w,y), let x WyV be the amount of u — > v flow using 
paths of length 3 that use (w,y). Clearly x WjV < x w>y , so it is sufficient to show that the rounding 
algorithm works when using the x values instead of the x values. We now divide this case into five 
subcases. 

Case 0: We first consider the case that there is some edge ( u,a) with x u a ^ 1/p- Then (u, o) is 
in Ei with probability 1, so it is definitely included in the spanner, and at least 1/p flow actually 
flows through (u, a). So this is essentially like the case of paths of length 2, just with flow of 1/p 
instead of 1/3. Let P a denote the set of length 3 paths that begin with the edge (u, a) (i.e. the 
set of paths where pi = a). If there is some path P £ P a with x atP2 > 1/p, then clearly by flow 
conservation x P2i „ > 1/p, so with probability 1 all of P is in E\. Otherwise, the probability that 
such a P is not contained in Ei is at most 1 — px a ^ P2 . Thus the probability that we get no P G P a 
is at most Yl PePa (l - px a , Pl ) < e~ p ^ p t p * Xa ' P2 = e ~ p ^e^/( p ) < l/ e , satisfying the lemma. The 
same argument can be made for the case that there is some edge (b,v) with xj, jV > 1/p, and if 
some other edge (w, y) has x WiV > 1/p then by flow conservation there must be some e = (u,a) or 
e = (b, v) with x e > 1/p. So for the rest of the argument we assume without loss of generality that 
x e < 1/p for all e G E. 

Cases 1-4: We now divide the length 3 paths into four types: 

1- \J P%pi,P2 — P&u,pi and px Plt p 2 < pXp 2 ^ v 

2- px u ,px — \J P%pi,P2 an d pXp 2:V < yj px Pl: p 2 

3. px u ,pi ^ yj P%pi,p2 — P%P2,v 

4. pXp 2jV < yj pXp 1: p 2 ^ P%U,pi 

These types are exhaustive, so since at least 1/3 flow uses these paths at least one of the types 
contains paths that correspond to at least 1/12 units of flow. Let Vi denote the paths of type i. 
We now consider each of the four types in turn. 
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Case 1: Suppose that at least 1/12 units of flow use paths from V\. For every edge (w, y) 
appearing in some path from V\, let x' w y be the actual amount of V\ flow that uses (w,y). Then 
x' wy — x w,y < %w,y for all (w,y). For any path P e Vi, a sufficient condition for P to appear 
in E' is for T pi < y ' P x 'p ltP2 an d for T p2 < \/px' pitP2 , since if this happens then (u,pi) £ E\, 
(Pi,P2) S E2, and (^2,^) £ ^l- For each P, let Ep be the event that this happens. Then 
Pr[Ep] = \fpx' pi ~ \/px' pi p2 = px' pi p2 . Let Y = J^PeVi Ep be the number of type 1 paths that are 
in E because of this sufficient condition. It is sufficient to show that the probability that Y > is at 
least a constant, since the repetition of the algorithm Clogn times makes this probability become 
at least 1 — 1/n 3 . To bound the probability that Y > we will use Janson's inequality | DP09| . 



Chapter 3]. Janson's inequality has two parameters: the expectation K[Y] and a value A which 
intuitively measures the amount of dependency. Informally, A is the sum over (ordered) pairs of 
dependent events of the probability that they both occur. In our setting, two events Ep and Ep/ 
are dependent if P and P share either a first edge or a last edge (if they share a middle edge 

then they are obviously the same path). Let P ~ P' if Ep and Ep/ are dependent. Then for us 

E[y] 2 

A = EpePi Ep'ePjiP'-piM^P A E P'\- Janson's inequality implies that Pr[Y = 0] < e WFa. 

Every length 3 path has a different middle edge, so the fact that at least 1/12 units of flow use 
paths in V\ implies that E[Y] = J2p£-p 1 P x p!,p2 — P/^- To bound A, we first consider the case the 
two paths share the first edge, i.e. we will try to bound Ai = J2aeV J2peP a Ylp'^PeP a Pi'[Ep AEpr], 
where P a is the set of paths in Pi that begin with the edge (n, a). Let P, P' £ P a be two such paths. 
In order for both Ep and Ep> to occur, it is necessary and sufficient for the following four conditions 
to hold: (1) T a < \J px' ap2 , (2) T a < J px' j , (3) T p2 < yfpx' ap2} and (4) T p < % < Jpx' Obviously 



the probability of this happening is exactly px' a p2 ■ ^Jpx' apl ■ min{y / px / a)P2 , ^Jpx' apl }. Using the 
geometric mean to upper bound the minimum, we get that Pr[Pp A Epi] < P^^ 2 {x' atP2 y i ^{x' ap , ) 3 / 4 . 



Thus 



Al = E E E P*[EpAE P ,} 

a&V PSPa P'^PePa 

<p 3/2 EE E «.^ 3/4 ^ 



Cl£V PdPa P' + P(iP a 

2 



<? 3/2 e(e« b ) 3/i V 

a&V \PePa / 




<p 3 / 2 E(-(T) I =^E«J 

3/2 



3/2 



< p 3/2 \/n- p(-J = pVn 



The same analysis can be done for the other type of dependent paths, when they agree on the 

_ p 2 /144 

last edge. Thus A < 2py/n. So Janson's inequality gives us that Pr[Y = 0] < e p/ 12 + 2 pv^. By 
setting p to be larger than 288-v/n + 12 we get that this probability is less than 1/e. 

Case 2: We will use a different style of analysis for the case when at least 1/12 units of flow use 
paths in Vi- Instead of bounding the expectation and using a concentration bound, we will simply 
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show that with constant probability Algorithm || includes a u — > v path. As in the previous case, 
let x' w y denote the actual amount of flow on V2 paths using edge (w,y). Then x' wy < x WtV < x w>y , 
so it suffices to prove that rounding using the x' values works with constant probability. Note that 
for any path P G V2 the middle edge is used only by P, so x' „ 2 = x pi:P2 . We will show that with 
constant probability there is a path P G V2 with T' < px' upi and T p2 < px' v . It is easy to see 
that this is sufficient for all three edges of P to be in E' . 

Let B C V be the set of vertices that are on the last hop of a path in 7^2 > i-e. b G E> if there is 
some P G V2 with P2 = b. Let B' = {b G B : x' bv > 21^} ' 3e ^he set of vertices from B with large 
flow through them, and let V' 2 = {P G V2 ■ P2 £ -B'} be the paths that go through these vertices. 
Since 1/12 flow total is sent on V2 paths, and each vertex not in B' can transport at most l/(24n) 
flow, at least 1/24 flow is sent on paths. Note that we are still in the case where every edge e 
has x e < 1/p, so l/(24n) < x' bv < l/p for all b G B'. For every b G B' let Ef, be the event that 
Tb < P x 'b v an< ^ ^ ^ = YlbeB' be the number of vertices in B for which this event occurs. Then 
^•[Y] = Ylb&B' P x b,v = P^2beB' x b,v > p/24, where the last inequality follows from the fact that at 
least 1/24 flow uses paths in "P 2 - Since these events are all independent, a simple Chernoff bound 
implies that Y > p/48 with probability at least 1 — e _p / 192 . Let B C £?' be the set of vertices in 
£?' for which this even occurs, so with high probability \B\ > p/48. 

Now we want to lower bound the probability that at least one path that passes through B has 
a corresponding first hop that is below the threshold. Let A be the set of first hops for vertices in 
B, so a G A if and only if there is some path P G V'2 and b G B with a = p\ and b = p 2 - It suffices 
for at least one a G A to have < px!^ a , since by the definition of A there is some corresponding 
b G B that completes a path in V^. Note that since \B\ > p/48 and every b G -B has > l/(24n), 
the total amount of flow passing through paths in that use vertices in B as their final hops is 
at least p/48 • (l/(24n)) = /?/(1152n). Thus J2 a eA x 'u,a — p/(1152n), so the probability that no 
a G A has T' a < px' u a is at most ELe^ 1 ~ P x 'u,a) < e~ p ^6A<,a < e -P 2 /(H52n) i Note that here we 
use the fact that that the T' thresholds are independent of the T thresholds, since otherwise this 
probability calculation for A could be dependent on the already chose thresholds of B. 

The total probability that we do not include some length 3 path is at most the sum of the 
probability that \B\ is not large enough and the probability that no corresponding a is selected, 
which is at most e -p / 192 + e~ p /( 1152n ) < l/e as desired. 

Case 3: In the third case there is at least 1/12 units of flow along paths in V3. We will use an 
analysis similar to that of the second case. Let P3 C V 3 be the collection of type 3 paths with large 
first hop capacity: P G P3 if x UjP1 > l/(24n). Obviously at most 1/24 flow can use low capacity 
first hops, so at least 1/24 units of flow uses paths in P3. If P G P3, then because P is type 3 we 
know that a/ px Pl , P2 > P x u,pi and thus £ pliP2 > px\ m > p/(576n). So a sufficient condition for a 
path P G -P3 to be in E' is for T' < px UtPl and for T p2 < \J p 2 j (576n) 2 = p/(24n). 

As before, let B = {b G V : 3P G P 3 with b = p 2 }. For b G B, let x' b v = Ep e p 3:fe=P2 f(P) 
be the amount of flow along paths in P3 that use b as a last hop. So x' bv < Xb >v - Let B' = {b G 
B : x' bv > l/(48n)}. Since at least 1/24 flow is sent using paths in P3, at least 1/48 flow is sent 

using paths in P3 that use a vertex in B' as a last hop. Call this set of paths P3. Now we partition 
B' into classes based on their x' values: let B^ = {b G B' : 1/2* < x' bv < l/2 4 ^ 1 }. Note that the 
first logp of these classes are empty since we are still in a setting where all edges e have x e < l/p, 
and there are at most logn classes by the definition of B' . These classes also partition the paths 
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in Pg (since every path in Pg uses a vertex from B' as a final hop), so at least one class contains 
at least 1/(48 log re) flow. Let Bi be this class. Then every vertex b £ B{ has x' bv > 1/2* and 
l-Bil > (1/(48 logn)/(l/2 i ~ 1 ) = 2 i ~ 1 /( 48 logn) > p/(481ogra). 

Let B C 2?j be the set of vertices b £ Bi with TJ, < p/(24n). The expected number of vertices 
in P is |Pj|p/(24n) > p 2 /(1152n log n). Since /? > Q,(y/nlogn) this becomes O(logn), so a simple 
Chernoff bound suffices to guarantee that \B\ > (1/2)E[|P|] > \Bi\p/(48n) > 2 i - l p/(2304n log n) 
with probability at least 1 — e _e, ( p / nlo § n ) > 1 — e _e ( logn ). Assuming that this occurs, since 
x' b > 1 /2* for all b £ Bi the amount of flow along paths in Pg that use a vertex in B as their last 
hop is at least /o/(4608nlogn). Let A be the set of vertices that are the first hops of these paths, 
i.e. A = {a £ V : 3P £ Pg, b £ B with a = p\ and b = p 2 }. Then J2aeA%u,a > />/(4608n log n) 
in order for there to be enough capacity to shop the flow. Now in order to complete some path 
from Pg we just need for one of these a £ A to have T' a < px u>a - The probability that this does not 
happen is at most \[ a&A {l - px u>a ) < e-^«e^»,» < e -P 2 /4608nlo g n_ 

Thus the total probability of failure is at most 2e~®( p / nlo s n ) (the probability that \B\ is below 
half of its expectation plus the probability that no A vertex has low enough threshold). Since 
p > @{y/nlogn) this becomes at most a constant, satisfying the lemma. 

Case 4: In this case at least 1/12 units of flow use paths in P4. This case is completely analogous 
to case 3, since we did not ever use directionality in our proof of case 3. Since this is the final case, 
it concludes the proof of the lemma. □ 



It is interesting to note that Lemmas 4.1 and 4.2 hold even for a weighted version in which every 



edge has an arbitrary nonnegative cost associated with it and our goal is to find the minimum cost 
3-spanner. So our approximation algorithm actually generalizes to this weighted version. 

Theorem 4.3. There is a polynomial time 0(y/n)- approximation algorithm for the unit-length 
directed 3-spanner problem, even with arbitrary costs on the edges. 

Proof. The algorithm is simple: solve LP (]2|) (note that this can be done exactly since in this 
setting the linear program has only a polynomial number of variable) and then repeat Algorithm 



with fresh randomness O (log re) times. Lemma 4.2 implies that this gives a valid solution with high 



probability, and Lemma [4.1| implies that it is a 0(y / n)-approximation. □ 
4.1 Extension to Fault-Tolerant Version 

It is easy to see that this algorithm can be trivially extended to the r-fault-tolerant setting. For 



each set F of faults, the analysis works the same as in Lemma |L2|. We just need to solve LP (|4j) 
instead of LP (||) and change the parameters (p and the number of times Algorithm || is repeated) 
to make the probability of failure small enough to apply a union bound to all possible failure sets 
F and edges (u, v) instead of just over the edges. In particular, for the vertex failure setting we 
need the probability of failure to be less than l/n r+2 and for the edge failure setting we need to 
probability of failure to be less than 1 /n 2r+2 . The main takeaway is that for directed 3-spanner we 
get fault-tolerance at a cost of only 0(r) instead of something exponential in r, as for the k > 3 
case and previous work on absolute bounds [ CLPR09| . 



Theorem 4.4. For any constant e > there is a polynomial time ( 37^^ > OQ^ryJn\og 2 n))- 
approximation algorithm for the r-fault-tolerant directed 3-spanner problem with unit lengths. There 
is also a true 0(r^/nlog 2 n)- approximation algorithm that takes n ^ time. 
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Proof. Let A be the number of times that Algorithm || is repeated. The analysis of Lemma 4.2 has 
the following failure probabilities for the different cases. If at least 1/3 flow is on the direct edge, 
then the probability of failure is 0. If at least 1/3 flow is on 2-hop paths, then the probability of 
failure is at most e -Ap / 3 . If at least 1/3 flow is on the 3- hop paths, then we have five subcases. For 
the zeroth case, if there is some edge with x e > 1/p then the probability of failure is at most (l/e) A . 
For case 1, the failure probability (i.e. the probability that Y = 0, which we bound by Janson's 
inequality) is at most e - ^'®^/^) . For case 2 the failure probability is at most (e~®^ +e~®( p / n )) A , 
where the first term is from the Chernoff bound guaranteeing that \B\ is large and the second term 
is from the probability that none of the A vertices have low enough thresholds. For the third case, 
the probability of failure is at most (e~ e ( p /( nlo s n )) _|_ e -©(p /(nlogn))^A ; w h ere the first term is from 
the Chernoff bound to guarantee that \B\ is at least half its expectation and the second is from 
the probability that none of the A vertices have low enough thresholds. Finally, the fourth case is 
symmetric to the third. 

It is easy to verify that we set A = C\ (2r + 2) log n and p = C<i y/n log n then all of these failure 
probabilities are less than l/n 2r+2 , which is what we needed for the edge fault setting (we can 
set A = Ci(r + 2) logn for the vertex fault setting). The total approximation that we get is thus 
Ap = 0(r^/nlog 2 n). Combining this with Theorem 2.2 completes the proof. □ 



5 LP-based approximation of 2-spanner 



The 2-spanner problem is qualitatively and quantitatively different from /c-spanner with k > 2: 



it is known [KP94, EP01] that it can be approximated to O(logre) and that this is tight [KorOl] 
(assuming P ^ NP). Note that this approximation algorithm is only known to work for the unit- 
length version. We first show that our LP relaxation (^) has integrality gap of ©(logn), and thus 
offers comparable approximation ratio. We further show that our approach, namely the flow-based 
LP relaxation and a direct rounding procedure, easily adapts to bounded-degree case which was 
studied in the literature [KP94, DK99| , [EP01 |, and also to the fault-tolerant case, for which no 
approximation was previously known. 



5.1 Lower Bound on the Integrality Gap 

We first show that the integrality gap is O(logn). The intuition is that we will apply the hardness 
reduction from set cover to 2-spanner to an instance of set cover that has a large integrality gap. 
We actually prove the gap for the more general setting of undirected graphs; it is easy to see that 
this implies the same gap for directed graphs. We first describe the generic reduction, then the 
particular set cover instance that we apply it to. 

Theorem 5.1. The integrality gap of LP (Q) for undirected unit-length 2-spanner is il(logn) 

Proof. Suppose we have a (unweighted) set cover instance with elements U and sets S, where 
\U\ = N and |<S| = M. We create a graph G with vertex set U U S U {xi : i G [k]}, where k = M 2 . 
In other words, there is a vertex for every element, a vertex for every set, and k new vertices 
xi, . . . ,X)~. Clearly the number of vertices is polynomial in the size of the set cover instance (it is 
in fact n = M 2 + M + N) . There is an edge from every xi to every set node and to every element 
node, an edge between every two set nodes, and an edge between a set node S G S and every 
e G U : e G S. More formally, the edge set is {{xi,S} : i G [k],S G 5} U {{xi,e} : i G [k],e G 
U}U{{S,S'}:S,S'eS}U{{S,e}:seS,eeU,eeS}. 
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The set cover instance that we use has element set F| \ {0}, so there are 2 q — 1 elements. There 
is a set S a for every a G F| (so there are 2 q sets), where S a = {e G \ {0} : a ■ e = 1}. We are 

using the normal notion of dot product over F^, i.e. a ■ e = a\e\ H + a q e q (mod 2). It is easy to 

see that every element is in exactly half of the sets. This large amount of overlap intuitively allows 
the linear program to "cheat". 

To see that the LP has a small solution, we will set the capacity of the edges between set vertices 
to 1, the edges between set vertices and element vertices to 1, and the edges between xi vertices and 
element vertices to 0. We will also set the capacity of edges between xi vertices and set vertices to 
2/M. Obviously this solution has cost at most kM-j^+M 2 + MN = 0(M 2 ) = O(n), so it remains 
to show that it is a feasible solution. To show this, for every edge in the original graph we need 
to find a way to route at least one unit of flow subject to our capacities from one endpoint to the 
other along paths of length at most 2. This is trivial for every edge that we set to have capacity 
1, so we just need to worry about edges incident on x^ nodes. For edges of the form {xj, S} with 
S G S, we can send 1/M flow on every edge from Xi to S (including the edge from Xi to S, and then 
flow that was set to sets S' / S can be forwarded along the {S"^} edge. For edges of the form 
{xi, e} with e G U, we can send 2/M flow from Xi to every set that contains e. Since exactly half 
of the sets contain e this adds up to a total flow of 1. This flow can then be forwarded directly to 
e, since there is an edge of capacity 1 between e and every set containing e. Thus this if a feasible 
solution to the flow LP of cost 0{n). 

Now we want to show than any integral solution has cost at least ri(nlogn). Consider some 
arbitrary integral solution (i.e. a setting of 0/1 capacities to every edge such that one unit of flow 
can be sent between the endpoints of any original edge using paths of length at most 2). Consider 
an edge {xj,e} with e G U. Either this edge has capacity 1, or there is some S G S with e G S 
such that the edges {xi,S} and {S, e} both have capacity 1. This is because the only paths of 
length at most 2 between Xi and e are paths of this form and the one direct edge. Since this is true 
for every e, the vertices adjacent to xi must form a valid set cover of this original instance (where 
an edge directly to an element e is equivalent to adding the set {e}). Thus the degree of any Xi 
node must be at least the size of the smallest valid set cover. For our set cover instance, it is easy 
to see that the size of the smallest cover is at least q. To see this, suppose otherwise, i.e. assume 
there is some collection of sets S ai , . . . , S aq _ 1 that covers the elements. Then nf = lS ai = 0, so 
C\ q i= \{e G F2 : cti ■ e = 0} = {0}. But this is a contradiction, since the intersection of q — 1 
hyperplanes in the g-dimensional vector space over F2 cannot be just a single point (that would 
require at least q hyperplanes). So any valid set cover has size at least q > logiV. 

So now we know that any integral solution to the flow LP has cost at least kq > M 2 log N = 
f2(nlogn), thus proving that the integrality gap of the flow LP is at least f2(logn). □ 



5.2 Upper Bounds via Direct Rounding 

We bound the integrality gap of the LP by randomized rounding of the LP solution. We first show 
a bound of O(logn), and then refine it to 0(log A), where A is the maximum degree of the graph. 
Both results hold also for directed graphs, and easily extend to the client-server version and to the 
augmentation version of [EP01] (where part of the spanner is already given). Finally, we explain 
how these two bounds easily extend to the r fault-tolerant version, losing only a factor of r in the 
approximation guarantee. 
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5.2.1 General digraphs (with unit length edges) 

Theorem 5.2. For the directed 2-spanner problem with unit lengths, (even the version with edge 
costs), the LP relaxation (|2|) has integrality gap O(logn). 

Proof. Consider a directed graph G = (V, E) with edge costs c e > (but unit edge lengths) and 
a feasible solution to LP (Q). The argument for undirected graphs is identical. We employ the 
rounding procedure depicted in Algorithm |3], where C > is a sufficiently large constant to be 
determined later. This is basically a simplified version of the rounding algorithm that we used for 
directed unit-length 3-spanner. 

Algorithm 3: Rounding algorithm for 2-spanner. 

1 Set p = Chin. 

2 For every v G V choose independently a random threshold T v £ [0, 1]. 

3 Output E' = {(u,v) G E : mm{T u ,T v } < p ■ x U)V }. 

The output of this algorithm has expected total cost 
K i'^2 c e\= E Pr[min{r M , T v } < p ■ x UjV ] < 2p) j c e x e . 

eSE' (u,v)eE eGE 

By Markov's inequality, with probability at least 2/3, the cost of the output solution exceeds the 
LP by factor < 3p < O(logn). 

We proceed to show that the output forms a 2-spanner (with high probability); recall that it 
suffices to verify the stretch for edges (u, v) 6 E. So fix (u, v) £ E and consider a u — v path of 
length 2, denoted henceforth P z = (u,z,v). A similar notation but with z = _L refers to a path of 
length 1, i.e. P± = (u,v). By the LP constraints, the sum of flows along all these paths is 

zevu{±} 

Here and in the sequel, notation like z £ V implicitly excludes any z £ V for which there is no such 
path P z = (u,z,v). If fp ± > 1/2, i.e. at least half of this flow is routed directly along the edge 
(u, v), then by the LP constraints x U)V > 1/2 and with probability 1 the edge (n, v) is included in 
E' . Otherwise, X^ev /p« — V^j now observe that whenever T z /p < fp z , the algorithm's output 
E' contains the entire 2-path P z (because by the LP constraints mm{x uz , x zv } > fp z > T z j p). Put 
in the contrapositive, for E' to contain no such 2-path P z , the event {Vz £ V, T z /p > fp z } must 
occur, which happens with probability, 

Pr[Vz £ V, T z /p > f Pz ] = H (1 - p ■ f Pz ) < e -'£«ev fe. < rT C l 2 . (6) 

zev 

Now setting C = 6 and taking a union bound over less than n 2 edges (it, v) £ E, we get that with 
probability at least 1 — 1/n, the algorithm outputs a 2-spanner of G. 

Taking a union bound, we conclude that with probability at least 1/2, the algorithm outputs a 



2-spanner of cost at most the LP value times 0(C log n), proving Theorem 5.2. □ 



5.2.2 Refinement to digraphs of bounded-degree 

Theorem 5.3. For the directed 2-spanner problem with unit lengths on graphs of maximum (in 
and out) degree at most A > 2, the LP relaxation (0) has integrality gap 0(log A). 
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Proof. Suppose now that (in and out) vertex-degrees in G are at most A > 2. Consider the same 
rounding algorithm, except that p = Clog A, and let us show that it succeeds with positive (but 
possibly small) probability. We shall require the following symmetric form of the Lovasz Local 
Lemma (see, e.g., [ ASOOj ]). 



Lemma 5.4 (Lovasz Local Lemma). Let A\, . . . ,A n be events in an arbitrary probability space. 
Suppose that each Ai is mutually independent of all but at most d other events Aj, and suppose 
that Pr[Ai] < p for all 1 < i < n. // ep(d + 1) < 1 then Pr[A™ =1 A~] > 0. 

For an edge (u, v) G E, let A UjV be the event that the output E' contains noii-v path of length 
at most 2. For a given edge (u, v) G E, the same analysis as above shows that 

Pt[A U)V ] < e~ p ^^fp z < A~ c/2 . (7) 

Observe that the event A UjV depends only on the random variables T z for z G N+(u), where N+(u) 
denotes the set of out-neighbors of u in G and u itself (and in our case, N + (u) includes v). Thus, 
A u>v is mutually independent of all events A u ^ v > for which N + {u) n N + (u') = 0, which by a simple 
calculation means it is independent of all but at most (A -)- 1)^ other events A u /^ v /. 

The local lemma applies to these events A UjV , but it guarantees a very small positive probability, 
which is not enough to bound the cost of the solution E' via a union bound. Instead, we incorporate 
the analysis of the cost \E'\ into the local lemma, by "splitting" it into multiple local events. For 
a vertex u G V, define the random variable to be the number of outgoing edges (u, v ) G E for 
which T v < p ■ x UjV . Define Z~ similarly for incoming edges, namely #{(v,u) G E : T v < p • x UjV }. 
Observe that we can bound the cost of the solution by 

\e'\ <EuM z u+Zu), 

by simply "charging" every edge chosen to E' to one of its endpoints. Define the event 
B u = [z+ + Z~ > 4p • (X)(u,«)e£ + Yl{ v ,u)aE x v,u)}- 

We may assume u is incident to at least one edge, say an outgoing edge, and thus v)eE x «,f — 1- 
Since Z+ is the sum of independent indicators with expectation < pYlu v)ee x u,v, we have 



by a Chernoff bound (see e.g. [ MR95H ) 

Pr [Z+ > 2pE( M6S ^] < e-^'^e,*-. < A"^ 4 . 

Similarly, E[Z~] < pJ2(v,u)eE x v,u and thus Pr [Z~ > 2pmax{l, Y,( v> u)eE x v,u}] < A~ c / 4 . For B u 
to occur, at least one of the last two events must occur, hence 

Pr[B u ] < 2A" C/4 . 

Observe further that the event B u depends only on the random variables T z for z G N + (u)LiN_(u). 

We can now apply the local lemma to all the events A U)V and B u . Indeed, each of these events is 
mutually independent of all but at most d = 0(A 2 ) other events, and setting C to be a sufficiently 
large constant and p = 2A~ C ^ yields p(d + 1) < 1/e. Thus, with positive probability none of the 
events A U)V and B u occurs, meaning that E' is a 2-spanner of G and 

\e'\<y,^ z u+ z u)<Y.^-{ E E 8lp. 

«eV u£V v:(u,v)eE v:(v,u)£E 

□ 
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5.2.3 Extension to the fault-tolerant version 

The proofs above easily extends to the fault-tolerant version of the problem, with respect to the 
LP relaxation Q). For concreteness, we discuss vertex- faults, but the same arguments hold for 
edge-faults as well. Recall that r denotes the maximum number of faults. 

The rounding procedure is the same (Algorithm |3|) , except that the factor p is increased by a 
factor of r. For every possible fault set F C V, \F\ < r, and every (u, v) G E\F, we get similarly to 
@ that the probability E'\F does not contain au — v path of stretch 2 is < e~ p ^ z ev fpz < n~ Cr l 2 . 
Since there are at most n r+2 such choices, we can apply a union bound and prove the following 
theorem. 

Theorem 5.5. For the r fault-tolerant version of the 2-spanner problem, even in the versions with 
edge costs and directed graphs, the LP relaxation (||) has integrality gap O(rlogn). 

The proof for bounded-degree graphs is similar: By the same analysis as (R), for a given F and 
(u,v) e E \ F, we get PrL4^J < e~ p ^^^p, < A~ Cr / 2 , and Pr[5f ] < 2A- Cr / 4 . Notice that we 
may restrict this collection of events to cases where all vertices of F are within distance at most 
2 from u, and then each event is mutually independent of all but at most A°( r ) events. Applying 
now the local lemma yields the following theorem. 

Theorem 5.6. For the r -fault-tolerant version of the 2-spanner problem on directed graphs of 
maximum (in and out) degree at most A > 2, the LP relaxation (|j) has integrality gap 0(r log A). 
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